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Abstract 

In this paper, we establish a determinantal formula for 2x2 matrix commuta- 
0^ | tors [X, Y] = XY — YX over a commutative ring, using (among other invariants) 

the quantum traces of X and Y. Special forms of this determinantal formula 
include a "trace version" , and a "supertrace version" . Some applications of these 
formulas are given to the study of value sets of binary quadratic forms, the fac- 
torization of 2 x 2 integral matrices, and the solution of certain simultaneous 
^ . diophantine equations over commutative rings. 

§1. Introduction 

Throughout this paper, the word "commutator" is taken to mean an additive com- 
mutator [X, Y] := XY — YX , where X, Y are elements in an (associative) ring R. The 
case of special interest to us in this paper is when R is a matrix ring M n (5*), where S 
\ is a commutative ring (with identity). 
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In the case S is a field (or a division ring) , several papers in the literature have dealt 
with the theme of computing the possible ranks of [X, Y] , when X is a given matrix. 
An important special case of this is to determine when [X, Y] can achieve full rank for a 
suitable Y; see, e.g. [GL] and [Sa]. More recently, two of the present authors have taken 
up the study of similar themes when the division ring S is replaced by a more general 
ring. Here, it is no longer possible to use effectively the notion of matrix ranks. But the 
classical full-rank case corresponds to the invertibility of the commutator [X, Y] , which 
is certainly a condition of interest and significance; see [KL 1; KL 2 ]. 

Over a commutative base ring S, a matrix M £ M n (5') is invertible iff det (M) is a 
unit in S. Thus, deciding the invertibility of [X, Y] rests on understanding the behavior 
of its determinant det [X, Y]. However, there seems to be no general formula available 
in the literature for the computation of such an n x n determinant. It is much more 
feasible to find nice formulas in the 2x2 case, since 2x2 determinants are so much 
easier to compute. Indeed, by using the Cayley-Hamilton theorem, or using classical 
adjoints, several 2x2 formulas can be written down for det [X, Y]. For a quick survey 
of this, see §2. These formulas often involve higher powers of X and Y; unfortunately, 
this feature tends to greatly limit their applicability, for instance, to the study of the 
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invertibility of [X, Y]. Optimally, we should hope to write down formulas expressing 
det [X, Y] in terms of quantities (e.g. traces and determinants) directly associated with 
the matrices X, Y, XY, and if necessary YX. 

In the beginning phase of our work, we were aware of the following "trace version" of 
a 2 x 2 determinantal formula for det [X, Y] in the context of P. I. -theory and invariant 
theory: if X, Y e M 2 (S) have traces t, t' and determinants 5, 5', then 

(1.1) det [X,Y] =A5'5-(ti(XY)) 2 -5t' 2 -5't 2 + ti(XY)t't. 

While this formula is a natural consequence of the invariant theory of 2 x 2 matrices 
(under simultaneous conjugation), it turned out to be not the most suitable for the 
applications we have in mind. In the search for a better alternative (for our purposes), 
we stumbled upon a "super-trace version" of a determinantal formula, using the "super- 
trace" of 2 x 2 matrices M = (m^), which is defined to be str (M) = mu — ^22 £ S. 
(The supertrace terminology and its concomitant notation "str" come from the theory 
of super- algebras.) If X, Y e M 2 (S') have determinants 8, 5' and supertraces r, r', the 
"supertrace version" of the commutator determinantal formula states the following: 

(1.2) -det [X,Y] =5r' 2 + 5'r 2 + tr(Xr)rV-str(XF)str(FX). 

Contrary to the case of (1.1), the existence of (1.2) does not seem to be predictable by 
invariant theory since the supertrace of a matrix in M 2 (S') is not a similarity invariant. 
Thus, in a manner of speaking, the existence of the formula (1.2) is a bit surprising. 

While the two formulas (1.1) and (1.2) look substantially different (having for in- 
stance a different number of terms), they do share some common features. This begs the 
question whether they are special cases of one single more general formula. In the world 
of quantum mathematics, it is easy to speculate what might be the case. If q is a fixed 
element in the ring S, there is a well-known notion of a q-trace for M = (m^) G M 2 (S'), 
defined by tr g (M) = m u + gm 22 . For q = ±1, this g-trace retrieves the trace and the 
supertrace respectively. The existence of (1.1) and (1.2) would seem to strongly sug- 
gest that there is a hybrid "quantum version" of a determinantal formula for det [X, Y], 
which would specialize to (1.1) when q — 1, and to (1.2) when q = —1. 

In §4 of this paper, we prove that this is indeed the case. The main result is given 
in Theorem 4. 1 which, for any given q, expresses q ■ det [X, Y] in terms of various 
quantities, including the g-traces of X, Y, XY, and YX. This theorem is preceded by 
Theorem 3.4 in §3, which treats the special case of g-traceless matrices X, Y. 

Some applications of the determinantal formulas are given in Sections 5-6. In §5, 
we present a new characteristic-free treatment of a norm theorem of Taussky [Tai, Ta 2 ] 
for quadratic field extensions, and give an explicit generic version of this result for 
commutative rings. This is followed by a last section (§6) devoted to the further study 
of the value sets of binary quadratic forms, and factorization questions on 2 x 2 matrices 
over rings. For given elements p,q in a commutative ring S, we show in Theorem 6.3 
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that a non O-divisor c E S can be written in the form pr +qs for some r,s G S iff the 
( cq\ 

matrix I ^ J can be factored into XY such that det (X) = cp, det (Y) = cq, and 

det [X, Y] = —c 2 . Using this factorization theorem, we prove a result (Thm. 6.8) on affine 
curves over rings which implies that, for any element c G S representable in the form 
pr 2 + qs 2 , there exist x,y,z G S such that px + qy = —c and xy — z 2 = — c 2 . Further 
applications of the determinantal formulas to the study of invertible commutators in 
matrix rings can be found in the forthcoming work [KLi]. 

§2. Preliminary Determinantal Formulas 

For the rest of this paper, R denotes the matrix ring M 2 (S'), where S is a fixed 
commutative ring. In this section, we put together a few general (mostly known) facts 
pertaining to the computation of the determinant of a 2 x 2 matrix commutator [X, Y] = 
XY — YX, where X, Y G R. This quick survey will pave the way to our more detailed 
investigations on det [X, Y] in the ensuing sections. 

To begin with, we note that, since [X, Y] is a 2 x 2 traceless matrix for all X,Y G R, 
the Cayley-Hamilton Theorem implies that 

(2.1) [X,Y] 2 = -det[X,Y}-I 2 . 

Thus, computing det [X, Y] is tantamount to computing the scalar matrix — [X, Y] 2 . 
This observation leads to the first available formula for det [X, Y] . 

Proposition 2.2. For any X,Y G R, det [X, Y] = tr (X 2 Y 2 ) - tr ((XY) 2 ). 

Proof. We may assume that S is the free commutative Z-algebra generated by the 
eight entries of X and Y. Computing the trace on the matrices in (2.1), we have 

(2.3) 2 det [X, Y] = - tr ([X, Y] 2 ) = - tr (XYXY + YXYX - XYYX - YXXY) . 

On the RHS, X (YXY) and (YXY) X have the same trace, and (XY) (YX) and 
(YX) (XY) have the same trace. Therefore, the RHS of (2.3) can be rewritten as 

-2 tr ((XY) 2 ) + 2 tr (XYYX) = -2 tr ((XY) 2 ) + 2 tr (X 2 Y 2 ) . 

Cancelling the factors of 2 gives the desired result. □ 

Next, we'll prove a determinantal formula involving the adjoint matrix. For a 2 x 2 
a b\ „ .„ ., „, f d —b^ 



matrix M = ^ G R, we'll write M' — ^ j for the classical adjoint of M. 

For E = , we have the relation M' = EM T E~ 1 , as was noted in [GK: (2.4)(3)]. 

This implies, in particular, that (MN)' = N'M' for M,N G R. The following lemma 
is a useful observation on the adjoint. 
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Lemma 2.4. For any matrices A, B G R = M 2 (S), we have 
(2.5) det (A - B) = det (A) + det (B) - tr {AB'). 



Proof. For any N e R, we have N ■ N' = (det N) ■ I 2 , and N + N' = (tr N) ■ I 2 . 
Applying these facts to the equations 

{A- B)(A- B)' = A A' + BB' - AB' - B A' 

= A A' + BB 1 - (AB' + (AB')'), 

we arrive at the desired equation. □ 

Proposition 2.6. For any matrices X,Y G R, we have 

(2.7) det [X, Y] = 2 (det X) (det Y) - tr (XYX'Y'). 



Proof. In the formula (2.5), let A = XY and B = YX. Then the LHS of (2.5) is 
det [X,Y], and its RHS is 

det (XY) + det (YX) - tr ((XY) (YX)') = 2 (detX) (detY) - tr (XYX'Y'). □ 

The formulas in (2.2) and (2.7) are interesting, but are not very suitable for prac- 
tical computations since their RHS's involve new matrices such as (XY) 2 , X 2 Y 2 , and 
XYX'Y'. Ideally, we would like to have formulas that express det [X,Y] in terms of 
quantities naturally associated with the matrices X, Y, XY, and YX. Such formulas 
will be obtained in the next two sections. Nevertheless, combining the two determinantal 
formulas obtained so far leads to the following curious trace identity. 

Corollary 2.8. For X,Y G R, (tr(XF)) 2 = tr (X 2 Y 2 ) + tr (XYX'Y 1 ). 

Proof. Equating the two expressions for det [X, Y] in (2.2) and (2.7), we get 

(2.9) tr (X 2 Y 2 ) - tr ((XY) 2 ) = 2 (det X) (det Y) - tr (XYX'Y'). 

Using the well-known identity 2 det (M) = (tr(M)) 2 - tr(M 2 ) for M = XY on 
the RHS, we can cancel the terms — tr ((XY) 2 ) from both sides of (2.9). After this, 
transposition yields the desired result. □ 



§3. Determinantal Formula: The q-Traceless Case 

In this section, we shall embark upon the task of finding a determinantal formula for 
det [X, Y] that is suitable for the applications we have in mind. At the beginning stage 
of our work, we had at our disposal both a "trace version" and a "supertrace version" 
of such a formula. This suggested to us that there is perhaps a "g-trace version" of the 
formula, which, for q — 1 and q — — 1 respectively, would specialize to the trace version 
and the supertrace version. Such a quantum-trace version would then provide some kind 
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of a "homotopy" from the trace version to the supertrace version, and conversely. In this 
and the following section, we shall begin the work to derive such a g-trace determinantal 
formula for det [X, Y\. Throughout these two sections, the element q G S will be regarded 
as fixed. Recall that, for any matrix M = (m^) G M^S"), the g-trace tr ? (M) is defined 
to be mn + <l m 22 G S. 

The strategy of our approach is to first find a g-trace determinantal formula in the 
case where X, Y are both q-traceless; that is, where tr q (X) = tr q (Y) = 0. To guess 
how such a formula might look like, it would be a good idea to start with the case of 
the ordinary trace; that is, where q — 1. Here, a formula for det [X, Y\ can be derived 
from the one in Proposition 2.2; see, e.g. the proof of [BBO: Lemma 6]. 

Proposition 3.1. Let X,Y G R = M 2 (S) be such that tr (X) = tr (Y) = 0. Then 

(3.2) det [X, Y] = 4 det (XY) - (tr (XY)) 2 . 

Proof. It is of interest to observe that the RHS of (3.2) is just the negative of the usual 
discriminant of the quadratic characteristic polynomial of the product matrix XY. Let 
5 = det(X) and 5' = det (Y) (so that det(XY) = S'S). Since X,Y are traceless, 
Cayley-Hamilton gives X 2 = —5I 2 and Y 2 = —S'I 2 . Thus, using (2.2) and the identity 
for 2 det (M) in the proof of (2.8), we have 

det [X,Y] =tr (5'5I 2 ) - tr ((XY) 2 ) =25' 8- (tr (XY)) 2 + 2 det (XY), 

which simplifies to the RHS of (3.2). □ 

Remark 3.3. We could have also proved (3.2) by applying the methods of Procesi in 
[Pri]. From Procesi's approach (see [KP: §2.4, p. 21]), it will be enough to prove (3.2) in 
the case where the first traceless matrix X is diagonal. In this case, (3.2) can be easily 
checked by a direct computation of both sides of the equation. 

To generalize Proposition 3.1 to the case of g-traces, we'll use the following standard 
notation in quantum computations: with q G S fixed, we write 

[n] = [n] q = (q n - l)/(q - 1) = q n ~ l + ■ ■ ■ + q + 1 G S (VnG Z+). 

In fact, we'll use this notation only for n = 2; namely, [2] = 1 + q. Treating this as 
a "quantum 2", we could use [2] 2 to replace the factor 4 in the formula (3.2). Also, 
for g-traces, tr q (XY) and tr q (YX) may no longer be the same, so it would be wise to 
replace the term (tr(XF)) 2 by tr q (XY)tr q (YX). Fortuitously, these "replacements" 
turn out to give the following correct generalization of Proposition 3.1. 

Theorem 3.4. Let X,Y G R = M 2 (S) be such that tr q (X) = tr q (Y) = 0. Then 

(3.5) q-det[X,Y] = [2] 2 det (XY) - tr q (XY) tr q (YX). 

In this formula, the first term on the RHS can also be replaced by tr q (X 2 ) tr q (Y 2 ). 
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Proof. To prove the last statement, we note again that Cayley-Hamilton implies X 2 = 
tX-det (X) I 2 . Thus, taking g-traces gives tr q (X 2 ) = -[2]-det(X). Multiplying this 
by a similar equation for Y, we see that tr q (X 2 ) ti q (Y 2 ) = [2] 2 det (XY). 

To prove (3.5), let X — ^ ^ , and Y = ^ ^ j^j . By direct computation, 

[X, Y] has the form ( j 2 ] r ' k J ' w ^ ere k = bg — cf, r = bh — df, and r' = dg — ch. 
Therefore, the LHS of (3.5) is 

(3.6) q-det[X,Y] = -q ■ ([2] 2 rV + k 2 ). 

To compute the RHS of (3.5), note that, in terms of descending powers of [2] : 

(3.7) tr q (XY) = q 2 dh + q (cf + dh) + bg = [ 2 ] 2 dh + [2] s + k, where s = cf - dh. 

Similarly, we have tr q (YX) = [ 2 ] 2 dh + [ 2 ] s' — k, where s' — bg — dh. (For later use, 
note that s' — s — k.) On the other hand, 

(3.8) det(XF) = {qd 2 + be) {qh 2 + fg) = ( [ 2 ] d 2 + be - d 2 ) ( [ 2 ] h 2 + fg - h 2 ) . 

Using (3.7) and (3.8), we can expand the RHS of (3.5) in the form J^ =0 [2] 1 By 
quick inspection, = 0, ao = k 2 , and a\ = sk — s'k = (s — s r ) k = —k 2 . Also, by using 
the definition of s and s', we compute easily that 

(3.9) a 2 = (bc-d 2 ){fg-h 2 )-s's = {bh-df){dg-ch)=r'r, and 

(3.10) a 3 = d 2 (fg - h 2 ) + h 2 (bc - d 2 ) -dh(s + s') = (df - bh) (dg - ch) = -r'r. 

With these computations of the dj's, the RHS of (3.5) becomes 

-[2]Vr+ [2] 2 rV- [2]k 2 + k 2 = -[ 2 ] 2 r V ([ 2 ] - l) - k 2 ([ 2 ] - l) 

= -q- ([2]Vr + A; 2 ), 

which is precisely the LHS of (3.5) as computed in (3.6). □ 

Note that, while Thm. 3.4 generalizes Prop. 3.1, the proof of the former is indepen- 
dent of that of the latter. Thus, in a mathematical sense, we could have completely 
dispensed with Prop. 3.1. However, this Proposition has clearly played an important 
role in the discovery (and formulation) of the formula (3.5), so we have included it for 
motivational reasons. In this regard, one might hope that there is also a shorter (or 
quicker) proof for (3.5) based on using a similar reduction (as in Remark 3.3) to the 
case where X is diagonal. Unfortunately, the g-trace of a matrix is not invariant under 
(ordinary) matrix conjugation, so the standard results in the invariant theory of matri- 
ces (as developed in [Pr 1; Pr 2 ]; see also [Fo] and [KP]) does not apply directly to the 
setting of Thm. 3.4. It is conceivable that some suitable form of a "quantum invariant 
theory" (based on an appropriate notion of quantum conjugation) might enable us to 
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make the above reduction. However, as far as we know, such a quantum invariant theory 
of matrices is not yet available. 

§4. Quantum- Trace Determinantal Formula 

After our preliminary investigations on the case of g-traceless matrices in §3, we are 
now in a good position to give the full statement for the quantum-trace determinantal 
formula. With respect to a fixed element q G S, this formula expresses q ■ det [X, Y] in 
terms of various quantities (including the g-traces) associated with the matrices X, Y 
and XY, YX, grouped in descending powers of [2] := 1 + q e S. If q e S happens 
to be a unit (e.g. a root of unity), we can then invert q and get an equation just for 
det [X, Y] . The full statement is as follows. 

Theorem 4.1. Let X, Y E R = M. 2 (S), with determinants 5, 5', traces t, t' , and 
q-traces r, r' . Also, let a = tr q (XY), and a' = tr q (YX). Then 

q • det [X, Y] = [ 2 ] 2 5' 5 - [ 2 ] (5 1 V ' + S't r) 

+{5t' 2 + 5't 2 + tr (XY) r'r - a' a). 

Here, in the last parenthetical expression on the RHS, the first three terms constitute a 
quadratic form in r' and r, with coefficients 5, 5', and tr(XY). 

Before we proceed, let us first give the appropriate interpretations of the above 
formula in the two important special cases where q — ±1. This is conceptually an 
important step, since Theorem 4.1 could hardly have come into existence without having 
had these two crucial special cases as its precursors. 

We first consider the case q — 1. Here, r = t, r' — t f , and a — a' — tr(XY). 
(These are all ordinary traces.) In this case, after a simple combination of terms, (4.2) 
simplifies to the following trace version of the determinantal formula : 

(4.3) det [X,Y] = 4 5'5 - ( tr (XY) f - 5t' 2 - 5' t 2 + tr (XY) ft . 

Remark 4.4. (A) As in the general case, the last three terms in the formula above 
constitute a quadratic form in t' and t, with coefficients —5, —5', and tr(XY). In the 
traceless case, this quadratic form drops out, and the formula boils down to (3.2). 

(B) Some more special cases of (4.3) are also worth noting. For instance, if tr (XY) = 0, 
(4.3) gives det [X,Y] = A5'5 - 5t' 2 - 5't 2 . On the other hand, if S = 5' = 0, (4.3) 
implies that tr (XY) divides det [X, Y] in the ring S. 

From the viewpoint of Procesi's papers [Pr l5 Pr 2 ] (see also [KP: §2.4]), det [X,Y] 
should be expressible (in case 2 e U(S*)) as a polynomial in the traces and determinants 
of X,Y, along with tv(XY). Thus, the existence of the formula (4.3) is entirely to be 
expected. In the invariant theory of 2 x 2 matrices, the two sides of the equation (4.3) 
represent the Formanek element associated with X and Y; see, for instance, [JLS]. Note 
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that, although the Formanek element in [JLS: §3.3] is expressed with the integer 4 in 
the denominator, this "4" is eventually "cancelled out" to give the RHS of (4.3). We can, 
of course, also get a formula for det [X, Y] using trace elements alone, by expressing all 
determinants in (4.3) in terms of traces via the formula 2 det (M) = (tr (M)) 2 — tr (M 2 ) 
(for every M e M 2 (S')). This determinant elimination process results in the following 
equation: 

det [X, F] = tr (X 2 ) tr (Y 2 ) - (tr (XY)) 2 - tl ( X2 ) t ' 2 + tl ( y2 )* 2 + tr {xy) ^ 

This time, the denominator "2" is no longer avoidable, so this formula would be mean- 
ingful only over the rings S in which 2 is invertible. It is, in retrospect, rather fortunate 
that the formula (4.3) is applicable to all commutative rings S. 

Next, we consider the case q = — 1 in Theorem 4.1. Here, for any M = (iriij) e 
M 2 (S'), tT q (M) is the supertrace str (M) := mu — rn 2 2 defined in the Introduction. 
Thus, r = str(X), r' = str(F), and a = str (XT), a' = str (FX). Now for q = -1, 
we have [2] = 1 + (—1) = 0, so all positive powers of [2] can be dropped! This leads 
to the following remarkably simple supertrace version of (4.2): 

(4.5) -det [X,Y] = 5t' 2 + 5' r 2 + tr (XY) t't - str (XY) str (FX). 

Here, as in Thm. 4.1, the first three terms constitute a quadratic form in r' and r, 
with coefficients 5, 5', and tr(XF). In (4.5), we have chosen to keep the (— l)-factor 
on the LHS, to remind ourselves of the fact that this LHS is really q ■ det [X, Y\. 

Remark 4.6. There are several ways to extend the definition of "str" to higher matrix 
algebras. For instance, one may define "str" on T = M 2n (S') by thinking of any X e 
T as a 2x2 block matrix with four nx n blocks, and taking str (X) to be 

tr (xn) — tr (^22) £ S. With this particular definition of "str", the formula (4.5) would 
remain meaningful. Unfortunately, it will no longer be true if n > 1. In fact, let 
X = Y = diag(/ n ,0 n ). Then [X,Y] = 0, so the LHS of (4.5) is zero. Now det (X) = 
det (Y) = 0, tr (XY) = n, and each of X, Y, XY, YX has also supertrace n. Thus, 
the RHS of (4.5) is n 3 — n 2 = n 2 (n — 1). So in this example, (4.5) holds if and only if 
n = 1 (assuming, say, char (S) = 0). Exactly the same remark could have been made 
about the trace version of the determinantal formula in (4.3). 

The following are some easy consequences of the supertrace formula (4.5). 

Corollary 4.7. Keep the notations used in the formula (4.5). 

(1) // XY has a zero diagonal, then —det [X,Y] = 5t' 2 + 5' r 2 . 

(2) If X has a constant diagonal, then —det [X, Y\ = 5t' 2 — str (XY) str (FX). 

(3) // XY = YX, then (str (XY)) 2 = 5t' 2 + 5't 2 + tr (XY) t't. 

Corollary 4.8. For any a, b,c,de S, we have 

(4.9) (ac - bd) 2 = ab (c - d) 2 + cd(a- b) 2 + (ac + bd) (a -b)(c-d). 
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Alternatively, if we write r = a — b and r' = c — d, then 

(4.10) (cr + br') 2 = abr' 2 + cdr 2 + (ac + bd) t't. 

Proof. The universal quartic identity (4.9) is just the result (4.7) (3), applied to a pair 
of diagonal matrices X = diag (a, b) and Y = diag(c, d). The alternative form (4.10) 
follows from (4.9) upon noting that c r + b r ' = c (a — b) + b (c — d) = ac — bd. □ 

The quaternary quartic identity (4.9) does not seem well known. A search of the 
literature and standard websites such as [Pi] did not turn up this curious algebraic 
identity. Of course, there is also a more sophisticated "g- version" of (4.9) (in five 
variables, including q), obtained by writing down the quantum-trace determinantal 
formula (4.2) for the diagonal matrices X and Y in the proof above. Unlike (4.9), 
however, this quinary identity is no longer homogeneous. 

Corollary 4.11. Assume S is a field, and keep the notations used in the formula (4.5). 
Then [X,Y] £ GL 2 (S) iff 5 r' 2 + 5' t 2 + tr (XY) t't = str (XY) str (YX). Also, X is 
a scalar matrix iff this equation holds for all Y G M^S*). 

Proof. The first statement follows from (4.5), since det [X,Y] = iff [X, Y] <£ GL 2 (S). 
For the second statement, the "only if" part is clear, and the "if" part follows from the 
easy fact (see [GL], or [KL 2 : (5.14)]) that, for any non-scalar matrix X (over a field S), 
there exists some Y E M 2 (S) such that [X, Y] e GL 2 (S). □ 

We shall now begin to work toward the proof of Thm. 4.1. Of course, once the 
determinantal formula (4.2) is written down explicitly, a direct check on Singular or 
Macaulay2 will instantly confirm that it is a universal identity for polynomials in nine 
commuting variables (the symbol q together with the eight entries of X and Y). How- 
ever, such a machine checking exercise would reveal no reason whatsoever for the truth 
of the formula. In view of this, we feel it still imperative to give a detailed conventional 
mathematical proof for the formula (4.2). Our proof is preceded by the following lemma. 

Lemma 4.12. For X, Y e M 2 (S') with notations as in Theorem 4.1, we have 

(4.13) a + a' -fr-tr' = [2] (tr (XY) - t't ). 

Proof. By working generically, we may assume, as in the proof of Prop. 2.2, that 2 is 
invertible in S. In this case, we can write det X = [ (tr X) 2 - tr (X 2 )]/2. Substituting 
this into the Cayley-Hamilton equation X 2 — (tr X) X + (det X) I 2 = 0, and polarizing 
the resulting equation (via X h-» X + Y), we get 

XY + YX — (tr Y)X- (tr X) Y + (tr X) (tr Y) I 2 - tr (XY) I 2 = 0. 

Taking g-traces on both sides gives the desired equation (4.13). □ 

Before proceeding to the proof of Theorem 4.1, we record a couple of consequences 
of the Lemma above. 
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Corollary 4.14. (1) For any traceless matrices X, Y G M 2 (S'), we have 

(4.15) tr q (XY) +tr q (YX) = [2]tr(XF). 
(2) For any matrices X, Y G M. 2 (S), we have 

(4.16) str (XY) + str (FX) = tr (X) str (Y) + tr (Y) str (X) . 

In particular, str (FX) = —str (XY) if X has a zero diagonal, or X, Y are both trace- 
less, or X, Y are both supertraceless. 

Proof. (1) is obtained from Lemma 4.12 by setting t — t' — 0. [In this special case, 
(4.15) expresses the fact that the ordinary trace tr (XY) is a "quantum average" of the 
quantum traces tr q (XY) and tr q (YX) (for any given q G S).] 

(2) is obtained by specializing Lemma 4.12 to the case q = —1, where the g-trace 
becomes the supertrace. [In addition, we can check easily that both sides of (4.16) are 
equal to 2 str (X * Y), where X * Y is the Hadamard product of X and Y (obtained 
by "entry- wise multiplication" of the two matrices).] □ 

We have now all the necessary tools with which to verify our formula (4.2). 

Proof of Theorem 4.1. To begin with, note that if the two matrices X, Y are both 
g-traceless (that is, r = r' = 0), then all terms on the RHS of the determinantal 
formula (4.2) drop out — except the first and the last terms. In this case then, we 
know that (4.2) holds, thanks to Thm. 3.4. To prove (4.2) in general, we should then 
try to make a reduction to the q-traceless case. As before, we may assume that S is 
the free commutative Q-algebra generated by q and the eight entries of X and Y. In 
particular, [2] _1 exists in the quotient field of the integral domain S. To make the 
desired reduction, let X = X — [2] _1 r/ 2 and Y = Y — [2]~ 1 t'I 2 . These have q- 
traces zero since r, r' were the g-traces of X and Y, and obviously [X, Y\ = [X ,F ]- 
Therefore, by Thm. 3.4, we have 

(4.17) q ■ det [X, Y\ = [2] 2 det (X F ) - tr ? (X F ) tr q (Y X ). 

Our job now is to compute the RHS of (4.17) in terms of the various quantities associated 
with X, Y, XY, and YX. Taking g-traces on the equation 

(4.18) X Y = XY- [2]- 1 (r'X + rF) + [2]- 2 r'r/ 2 , 

we see that tr g (X Fo) = °~ — [2] _1 r / r where a = tr q (XY), and similarly, tr q (Y X ) = 
(j / -[2]- 1 r'r, where a' = tr q (YX). On the other hand, since det (X-A/ 2 ) = \ 2 -t\+5 
for any parameter A G S (notations as in Thm. 4.1), we have, for A = [2] _1 r: 

det (X ) = [ 2 ] - V - [ 2 ] -H r + 5 ; similarly, det (Y ) = [ 2 ] ~ 2 r ' 2 - [ 2 ] ~H V ' + 5 '. 

Substituting all of these expressions into the RHS of (4.17), we get 

g-det[X,F] = ([2]5-tr+[2]-V 2 )([2]5'-t'r'+[2]-V' 2 ) 
(4 ' 19) _( ff _[ 2 ]-VV)(,'-[2]-Vr). 
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Expanding the RHS formally into J2f=-2 we have clearly 5_ 2 = 0, b 2 = S'S, and 

bi = —{St'T 1 + 8'tr). Thus, [2] 2 6 2 + [2] b\ already produces the first two groups of 
terms on the RHS of the formula (4.2). The remaining terms on the RHS of (4.19) are 

6 + [2]- 1 6_ 1 = (5t' 2 + 5'r 2 + t'tr'r-a'a) + [2}- 1 r'r(a + a' -t'r-tr') 
= (5T' 2 + 5'T 2 + t'tr'r-a'a)+r'r(tT(XY)-t't) 

in view of Lemma 4.12. After cancelling the two t'tr'r terms, we get precisely the last 
group of terms in the desired determinantal formula (4.2) ! □ 

Remark 4.20. Of course, proving the quantum-trace version of the determinantal for- 
mula (4.2) in one stroke for all q makes it unnecessary, for instance, to handle separately 
the cases q — 1 and q — — 1. But more discerningly, working directly in the quantum- 
trace case actually makes the proof of (4.2) easier as it enables us to "manage" many 
terms at once by organizing (and simplifying) them in "[2]-adic expansions" (as in the 
proof above). The same proof, written out in the special cases q = 1 or q — — 1 would 
look harder and more confusing since the pattern of the [2]-adic expansions would no 
longer be apparent. The same remark could have been made about the proof of Thm. 3.4. 

§5. Relations to Binary Quadratic Forms 

The last two sections of this paper are devoted to some applications of the two de- 
terminantal formulas obtained in (4.3) and (4.5). The first applications, given in this 
section, offer a characteristic-free generalization of a theorem of Olga Taussky relat- 
ing the determinants of 2 x 2 commutators of integral matrices to norms in quadratic 
extensions of Q, and some extensions of this theorem to the setting of matrices over 
commutative rings. 

In [Tax], Taussky showed that, if X,Y G M 2 (Z) and an eigenvalue uj of X is 
irrational, then — det [X, Y] is a norm from the quadratic number field Q(oj). A 
converse of this theorem was obtained in [Ta 2 ], where Taussky proved that, if n e Q isa 
norm from a quadratic number field K, then n = —det [X, Y] for some X, Y e M 2 (Q) 
such that X has its eigenvalues in K . Although Taussky assumed that X, Y were 
integral matrices in the first theorem above, this assumption was not needed, so both of 
her theorems may be thought of as results on rational matrices. Actually, the use of the 
rational field Q is also not crucial, so one may try to further replace Q by a field F. 
However, Taussky's proofs in [Tai, Ta 2 ] (and even her later proof using cyclic algebras 
in [Ta 3 ]) assumed implicitly that char (F) ^ 2, and did not apply to all fields. 

In the first half of this section, we shall present a new view of both of Taussky's 
results, formulating them as a "commutator characterization" for the norm elements 
under any quadratic field extension K/F (separable or otherwise). Here, we are able 
to give a rather short proof (motivated by the determinantal formula (4.3)) that works 
uniformly in all characteristics, and is completely within the realm of matrix theory 
(independently of the splitting criterion for cyclic algebras used in [Tas]). Furthermore, 
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the proofs of the "if" part and the "only if" part below are based essentially on one 
single argument, and the "if" part will, later in the section, lead to a constructive 
generic version of the same result for commutative rings. 

Taussky's Norm Theorem 5.1. Let K/F be any quadratic extension of fields of any 
characteristic. Then an element n G F is a norm from K iff n = — det [X, Y] for 
some I,y 6 M 2 (-F) such that K is the splitting field of the characteristic polynomial 
of X. 

Proof. We'll first prove the harder "if" part. Given n = —det [X, Y] as in the 
theorem, let t = tr (X), 5 = det (X), and let u be an eigenvalue of X. By assumption, 
K = F(u), so the minimal polynomial for u over F is f(X) = A 2 — tX + 5 (the 
characteristic polynomial of X). With respect to the F-basis {l,w} on K, the norm 
form of Kj F is easily computed to b^j] 

(5.2) N (x, y) := N K / F (x + y tu) = x 2 + y (tx + 5y), where x,y E F. 

Since /(A) is irreducible over F, we may assume (after a conjugation in M 2 (-F)) that X 

'0 -, 
1 t 

matrix from Y (which does not change [X, Y ] ) , we may assume that Y = 
(for some e, f,g G F). Since tr (XY) = f — 5 g, the trace formula (4.3) yields 



is in its rational canonical form; that is, X = ( ^ f ] . Also, after subtracting a scalar 

e / 
9 



-det[X,Y] = 4Sfg+(f-5g) 2 + 6e 2 -fgt 2 -(f-6g)te 
= (f + Sg) 2 + (e + tg)(5e-tf). 

Letting x := — (/ + 5 g) and y := e + tg, we get —det [X, Y] = x 2 + y (5 e — tf). 
(Of course, this negative determinant could also have been gotten from the supertrace 
formula (4.5), or even from a direct determinant computation^) Noting that 

(5.3) tx + 5y = -t(f + 5g) + 5 (e + tg) = Se-tf, 

we conclude from (5.2) that n = -det [X, Y] = N (x,y) G N K/F (K). 

The converse is now easy! Indeed, we can completely bypass the work in [Ta2], and 

simply "reverse" the above argument to get what we want, as follows. Let n G F be 

a norm from K. Write K = F(u) for a primitive element u, and let A 2 — t A + 5 be 

the minimal polynomial of u over F. Then n = Nk/f{% +2/^) fo r some x,y G F. 

fO —8\ fy —x\ 

Defining X := I ^ J and Y = I J, the computation in the last paragraph 

(with g = 0, / = —x, and e = y) gives —det [X, Y] = Nk/f(x + y u) = n. Of course, 
the splitting field of the characteristic polynomial of X is just K. □ 



1 If we had used {1, -co} as basis instead, the norm form would have been x 2 — t xy + 5 y 2 , which is 
precisely the homogenization of the characteristic polynomial of X. 

2 In fact, if we use the supertrace formula (4.5), x — — (/ + S g) will show up naturally as str (XY), 
and we will have str (YX) = f + 5 g = —x, as is also predicted by the formula (4.16). 
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Remark 5.4. The following observation on the "if" part of Theorem 5.1 is in order. Let 
F — Q, and assume (as in [Tai]) that X, Y e M 2 (Z). If X is in its rational canonical 

form ^ ^ , the proof of the "if" part above shows that — det [X, Y] is, in fact, the 

norm of an algebraic integer in the quadratic field K. However, in general, this need 
not be the case, as was pointed out by Taussky in [Ta 2 : p. 1]. For an explicit example, 

take X — ( ^ ^) and Y :— ( \ ^ J. Here, the quadratic field K in question is 



-2 1] ' ^3 / 

Q (y/— 31 ), and —det [X, Y\ = 419 is not the norm of an algebraic integer from K, 
since x 2 + 31 y 2 = 2 2 • 419 = 1676 has no solution in Z. (Of course, X is only "close" 
- but not equal — to its rational canonical form!) Nevertheless, in confirmation of the 
"if" part of Thm. 5.1, 4 2 • 419 = 6704 = x 2 + 31y 2 is solved by (x,y) = (±77, ±5), 
so 419 = Njr/ Q (a) for a = (±77 ± 5 v /= 3T)/4 in Q( V /= 3T). For more information on 
this example, see Remark 5.15(B) below. 

Since (5.1) was formulated as a field-theoretic theorem, a natural question to ask 
would be whether something in a similar spirit can be said about commutative rings. 
Note that the proof for the "if" part of Theorem 5.1 does not extend to rings, since we 
can no longer apply the standard linear algebra theorem on rational canonical forms. 
Of course, the field-theoretic theorem, applied to the quotient field of a suitable generic 
ring, would give an existential norm formula on —det [X, Y] for a pair of generic ma- 
trices X, Y. However, this formula would involve an unknown "denominator" factor, 
which would make it only a strictly formal result. A useful "ring-theoretic version" of 
Theorem 5.1 should thus be one that gives an implementation of such a formula, with 
explicit information on the denominator factor. Such a version will be given (for any 
commutative ring) in Theorem 5.10 below, where we'll show that the denominator fac- 
tor can actually be taken to be either one of the off-diagonal entries of the matrix X . 
The proof of this theorem is based on a further exploitation of the explicit determinant 
computation of commutators in the proof of the "if" part of Theorem 5.1. 

As in the earlier sections, S will continue to denote a commutative ring. Instead of 
working with the norms from various degree 2 extensions of S, we now choose to work 
directly with binary quadratic forms over S. For any s,t,5 e S, let us denote the "value 
set" (over S) of the quadratic form s x 2 + t xy + 5 y 2 by V[s,t, 5]; that is, 

(5.5) V[s,t,8] := {sr 2 1 +tr 1 r 2 + 5r 2 2 : r 1 ,r 2 eS}. 

In the case where t — 0, we'll simply write V[s, 5] for V[s, 0, 5]. Over the ring of 
integers, of course, the study of these value sets is an important and time-honored 
enterprise that goes back to the classical work of Fermat, Euler, Lagrange, Legendre, 
and Gauss. In the rest of this section, we'll work over a commutative ring S in the case 
s — 1, since x 2 + t xy + 5 y 2 arises precisely as a norm form of the quadratic ^-algebra 
S [ A]/(A 2 — t A + 5). Before coming to the ring-theoretic version of Thm. 5.1, we first 
recall the following elementary result on the value sets V[ 1, t, 8\ over S. A short proof 
is included for the reader's convenience. 
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Proposition 5.6. For any t, 5 G S and A := t 2 — A5, we have the following inclusions: 

(5.7) AV[l,t,5]CV[l,-A]CV[l,t,5]. 
If 2 is invertible in S, then V[ 1, -A] = V[ 1, t, 6]. 
Proof. For any w,z G S, we have an identity: 

(5.8) w 2 -(t 2 -A5)z 2 = {w-t z) 2 + t(w- tz) (2z) + 5 (2z) 2 . 

This implies that the set {x 2 + txy + 5 y 2 : x G S, y G 2S} is equal to the set {w 2 — Az 2 : 
w,z G «S}. The second inclusion in (5.7) follows from this observation. The first inclusion 
follows from the usual "completion of squares" identity: 

(5.9) 4(x 2 + txy + 5y 2 ) = (2x + ty) 2 - (t 2 - A5) y 2 . 

The last conclusion of the Proposition is clear from (5.7). □ 

We are now in a position to extend Theorem 5.1 to the setting of rings. 

Norm Theorem 5.10. (Ring Version) (1) Given t, 5 G S, any n G V[l,t, S] has 
the form -det [X, Y] for some X,Y G M 2 (S') such that tr (X) = t and det (X) = 5. 

(2) For any X = ^ G M 2 (S), let t = tr(X), 5 = det (X), and A = t 2 - A5. For 
any Y G M 2 (S), we have -c 2 det [X,Y] G V[l,t,5], and -4 c 2 det [X,Y] G V[ 1, —A]. 

(3) Keep the notations in (2) above. If t — tr (X) and t' — tr (Y) are both in 2 S, then 
-c 2 det [X,Y] G V[ 1, —A]. 

Proof. (1) follows from the last paragraph in the proof of Theorem 5.1, since the 
construction there works over any commutative ring S. 

(2) After subtracting a scalar matrix from Y, we may assume that Y = ^ . We 

can work generically and thus assume that S is the polynomial ring over Z generated 
by the seven (commuting) variables a, 6, c, d, e, f, g. In this way, c _1 makes sense in the 

/0 c _1 6 \ 

quotient field F of 5. Let X 1 = c _1 (d- a)) G M 2(-^)- Applying formally the 
calculation in the proof of the "if" part of Thm. 5.1, we can write 

(5.11) - det [X u Y] = x 2 + c-\d - a) xy - c^by 2 , 

where x = — (/ — c^bg), and y = e + gc~ 1 (d — a). From these, we have cx, cy G 5. 
Defining X 2 — c • X 1 — ^ aj an< ^ mu ltiplying (5.11) by c 4 , we see that 

(5.12) -c 2 det [X 2 ,Y] = (c 2 x) 2 + (d - a) (c 2 x) (cy) - bc(cy) 2 . 
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Letting a = c 2 x G S and (3 = cy G S, the RHS of (5.12) can be transformed as follows: 



a 2 + {d-a)af3 -bc(3 2 = a 2 + (t - 2 a) a (3 + {5 - ad) f3 2 

= (a- a/3) 2 + ta/3 - a (a + d) (3 2 + 8 (3 2 

= (a-a/3) 2 + t(a-a/3)/3 + 5/3 2 eV[l,t,8]. 

Since [X 2 ,F] = [X 2 + aI 2 ,Y] = [X,Y], this proves the first conclusion in (2). The 
second conclusion follows from this and the first inclusion in (5.7). 

(3) Write t — 2 s and t' = 2 s' (for suitable s,s' G S). Suppose the desired conclusion 
is true for traceless matrices. Then it holds for X := X — s I 2 and Y := Y — s'I 2 ; that 
is, -c 2 det [X ,F ] G V[ 1, 4det (X ) ]. (Note that X has discriminant -4det(X ), 
and the (2, l)-entry of X remains to be c.) To compute det (X ), we use the fact that 
det (X - XI 2 ) = X 2 -t\ + 5. For A = s, this leads to 

(5.13) 4 det (X ) = 4(s 2 -ts + 8) = t 2 -2t 2 + 45 = -(t 2 -45) = -A. 

Therefore, -c 2 det [X ,Y ] G V[l,-A}. Since [X,Y] = [X ,Y ], this proves (3). Start- 
ing afresh, we may thus assume that X — \ J and Y = \ ^ J . In this case, 



c —a y y g — e 

(3) can be proved by checking the explicit equation — c 2 det [X, Y] = P 2 — A Q 2 , where 

(5.14) P = 2a(ag — ce) + c(bg — cf), and Q = ag — ce. 

Since in any case such an equation can be quickly checked by hand or by machine, we 
will not give its detailed derivation here. □ 

Remark 5.15. (A) Note that, in the case where S is a field F, the results in (1) and 
(2) above do retrieve the Norm Theorem 5.1. In fact, in part (2), if the characteristic 
polynomial of the matrix X has a quadratic splitting field Kj F, then the off-diagonal 
entries b, c of X cannot both be zero. If c ^ 0, then the conclusion — c 2 det [X, Y\ G 
V[l,£,5] amounts to —det [X, Y] G \^[1,£,5], since c is invertible, and V[l,£,5] is 
closed under multiplication by squares. If b ^ 0, a simple transposition argument gives 
the same conclusion. But of course, the proof of Thm. 5.10 would not have been possible 
if we had not first worked out the proof of the field-theoretic version Thm. 5.1. 

(B) Since the proof of (5.10) (2) is completely constructive, we can very easily implement 
it and test its accuracy. For instance, let us apply it to the two matrices X, Y in 
Remark 5.4 over the ring S — Z . Here, Y already has the desired form in the proof of 
(5.10)(2), with e = 4 and / = g = 3, while (a, b, c, d) = (0,4,-2,1), with t = 1, 5 = 8, 
A = -31, and -det [X,Y] = 419. We know that 419 £ V[l,l,8] (since 419 is not 
the norm of an algebraic integer in Q (V— 31 )), so the c 2 factor cannot be dropped from 
the first conclusion of (5. 10) (2). On the other hand, following the proof of (5. 10) (2), we 
compute easily that a = —36, and (3 = —5. Since a = and c = —2, this proof predicts 
that c 2 • 419 = 1676 G V[ 1, 1,8], with the equation 1676 = a 2 + a (3 + 8 (3 2 solved by 
(a, (3) = (—36, —5). In view of this and the equation (5.9), the last part of (5.10)(2) also 
predicts that 4 c 2 • 419 = 6704 G V[ 1, 31 ], with the equation 6704 = a 2 , + 31 f3 2 solved 
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by (a ,j3 ) = (2 a + (3,(3) = (—77, —5), as we have already mentioned in Remark 5.4. 
Since c 2 • 419 = 1676 ^ V[ 1, 31 ], this shows that the factor of 4 also cannot be dropped 
from the second conclusion of (5. 10) (2). (We leave it to the reader to check the same 
statement if we had started instead with — det [V, -X"] = 419. Note that (5.10)(3) does 
not apply to either case since tr (X) = 1 and tr (Y) = 4 are not both even ! ) 

We'll close this section with a supplement to Theorem 5.10 in the case where the 
matrix X has a constant diagonal. In this case, we have good control on the values 
of —det [X, Y] without pre- multiplying them by the factor c 2 . The following result is 
not covered by Theorem 5.10, but its proof is straightforward in light of the supertrace 
determinantal formula (applied in its special form in (4. 7) (2)). 

Proposition 5.16. Let X = ^ G R = M 2 (S), and assume that bS + cS = rS 

for some r G S. Then {—det [X, Y] : Y G R} = V[r 2 , —be]. If b, c are coprime in S, 
this set is equal to V[l, —be]. 

~ ( vj x \ 

Proof. We may assume that a = 0, and that Y = 1 . Then, by (4.7) (2) (or by a 



direct computation), we have — det [X, Y\ = (by — ex) 2 — bcw 2 . Since w ranges over S 
and by — cx ranges over the principal ideal r S, these values comprise precisely the set 
V[r 2 , —be] C V[l, —be]. If bS + cS = S, we can take r = 1, in which case the inclusion 
becomes an equality. □ 

§6. Applications to Matrix Factorizations and AfRne Curves 

Continuing the work in §5, we shall give in this section some applications of the 
supertrace determinantal formula (4.5). The main themes of our study will now be 
the factorization of 2 x 2 matrices, and the solution of certain quadratic diophantine 
equations over a commutative ring S. The norm forms of quadratic ring extensions over 
S studied in the last section are the binary quadratic forms x 2 + t xy + 5y 2 , which are 
monic in x. In this section, we shall take up the case of a binary diagonal quadratic 
form px 2 + qy 2 (which is no longer monic in x). In the spirit of the results (5.1) and 
(5.10), we would like to give a "commutator characterization" for the value set of such 
a diagonal form over S; that is, 

(6.1) V[p, q] := {pr\ + qr\ : ri,r 2 eS}. 

The study of these sets is of interest over both rings and fields. For example, V[l,l] 
consists of all sums of two squares in S, and asking if —1 G V\ 1, —d] amounts to solving 
the "negative Pell's equation" x 2 — dy 2 = —1 over S. If F is a field of characteristic 
7^ 2, the criterion for the splitting of the F-quaternion algebra 

(6.2) | i 2 = p, j 2 = q, ij = -ji ) (where p, q G F \ {0}) 

is given by 1 G V[p, q] (see [La: p. 58]). Accordingly, the Hilbert symbol (p,q)F is 
defined to be 1 or —1, depending on whether or not the quadratic form px 2 + qy 2 
represents 1 over F. 
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Using the supertrace determinantal formula (4.5), we are able to provide in Theorem 
6.3 below a matrix-theoretic criterion for a non 0-divisor c G S to belong to V[p, q] over 
a commutative ring S . As far as matrices are concerned, the problem under study here 
is rather distinct from that investigated in the second half of [Ta^] • In the latter, Taussky 
studied the possibility of expressing the matrix A in Theorem 6.3 as a commutator (in 
the case where q = 1), whereas here we are concerned with the factorizations of the 
matrix A with certain commutator properties. 

Factorization Theorem 6.3. Let A = ( ° q J G R = M 2 (S), and let c G S be a 



-p 

non 0-divisor. The following are equivalent: 

(1) ceV[p,q\. 

(2) There exist X,Y E R such that XY = c • A, det (A) = cp, det (Y) = cq, 
and det [X, Y] = -c 2 . 

(3) There exist A 1; Y 1 G R such that X 1 Y 1 = c ■ A, det (Ax) = cq, det (Yi) = cp, 
and det [Ax,Yx] = — c 2 . 

Proof. (3) =>- (1). Let X 1: Yx G R be as in (3). If their supertraces are r and s, 
then (since AxYx = c- A has zero diagonal) Cor. 4.7(1) gives c 2 = — det [X^l^] = 
(eg) s 2 + (cp) r 2 . Cancelling c, we get c = pr 2 + qs 2 G V[p, 5]. 

(1) =^> (2). (This implication does not require c to be a non 0-divisor.) If c e V[p, 9], 
write c = pr 2 + gs 2 for some r,s G 5, and let a = s + pr, b = r — qs. Then 
ar — bs = pr 2 + gs 2 = c. For the matrices 

(6.4) X=f° 6 V and Y=( b qT 

\ps pr J \—a —qs 

we have det (X) = (ar — bs)p = cp, and det (Y) = (ar — bs) q = cq. Also, 

(6.5) XY=( (h ° . S^-teA^ oq\ =c A 
v 7 \p(bs — ar) y cp y 

Since str (A) = a — pr = s and str (Y) = b + qs = r, Cor. (4.7) (1) gives 

(6.6) det [A, Y] = -(cp)r 2 - (c q) s 2 = -c 2 . 

(2) =^ (3). (This implication also does not require c to be a non 0-divisor.) Given A, Y 

as in (2), let X\ — Y' and Y\ = —A' (where the primes denote the adjoints). Then 
det (Ai) = det (Y') = det (Y) = cq, and similarly det (Yi) = det (A) = cp. Moreover, 
X{Yi = —Y'X' = -(XY)' = -c(-A) = c- A. Finally, 

[Xi, Yi] = [Y',-X'} = -Y'X' + X'Y' 
= -(XY)' + (YX)' = -[X,Y]' 

implies that det [Ax,^] = det [X,Y] = —c 2 , as desired. □ 

Remark 6.7. (A) In the case where both c and p are non 0-divisors, the condition 
in (2) that det (Y) = cq could have been dropped, since it would have followed from 
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XY = c ■ A and det (X) = cp. However, the present form of the statement in (2) is 
more symmetrical. (The same remark can be made about the statement (3).) 

(B) The implications (3) =4> (1) and (2) =^ (1) in Thm. 6.3 need not hold if c G S is 
a 0-divisor. For instance, let S be the commutative local Q-algebra generated by x, y 
with the relations x 2 = y 2 = xy = 0, and let c — x, p = q = y. Then (2) and (3) are 
trivially satisfied by the matrices X = Y = X 1 = Y 1 = 0. However, c = x V[y, y}. 

By further developing the ideas used in the proof of the implication (1) =>- (2) 
above, we get also the following unexpected algebro- geometric result on affine curves 
over commutative rings. 

Theorem 6.8. Given p,q,c G S, let C = C p ^ c be the plane conic {(r, s) G S 2 : 
pr 2 + qs 2 = c}. Let Q = Q c be the quadric surface {(x,y, z) : xy — z 2 = —c 2 }, and let 
P — Pp,q,c be the "vertical plane" {(x,y,z) : px + qy = —c} (both in S 3 ). Then there 
is an affine morphism f : C —> P fl Q defined by 

(6.9) /(r, s) = (r [2qs — r), — s (2pr + s), rs + pr 2 — qs 2 ) (V(r, s)gC). 

Proof. Before proceeding with the proof, note that the conic C is nonempty iff c G 
V[p,q]. In the case C = 0, of course, the statement of the theorem is vacuous. In the 
following, we may thus assume that C ^ 0. 

Given any point (r, s) G C (that is, with pr 2 + qs 2 = c), let us use the notations 
and conclusions in the proof of (1) =^> (2) in Thm. 6.3 (recalling that this implication 
did not require c to be a non 0-divisor in S). Since M := [X, Y] has trace zero, it can 

— z x\ 

1 (for some x,y, z G S). To compute this matrix, 
we use the definitions of X, Y in (6.4) (and the fact that XY = c ■ A ) : 



M = XY-YX = 




Recalling that a = s + pr and b = r — qs, we have 

{x = q 2 s 2 — b 2 = (qs + b) (qs — b)—r (2qs — r), 
y — p2 r .2 _ a 2 _ _ ^ ^ r _|_ a ^ _ _ s (2pr + s), 
z = ab + pqrs = (s + pr) (r — qs) + pqrs = rs + pr 2 — qs 2 . 

These are quadratic forms in {r, s} (if we think of {p, q} as constants), which define an 
affine morphism / from C to S 3 , with the obvious property that f(—r, —s) — f(r, s). 
Furthermore, the fact (from (6.3) (2)) that — c 2 = det [X, Y] = xy — z 2 implies that 
f(C) C Q. Finally, for x,y G S as defined above, we have 

(6.11) px + qy = pr (2qs — r) — qs (2pr + s) = —(pr 2 + qs 2 ) = — c, 

so we have /(C) C P also, as desired. □ 
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Remark 6.12. Some congruence properties of the values of x, y, z are note-worthy. For 
any (r, s) G C, (6.10) clearly implies that x = — r 2 (mod 2q), and y = — s 2 (mod 2p). 
As for 2, we can rewrite it as follows: 

(6.13) z = rs + (c — gs 2 ) — gs 2 = s (r — 2gs) + c. 

We did not use this expression for z in (6.10) since it is not symmetrical in p and 
q (and also not homogeneous in r and s). However, this new expression does give 
some additional information on z; that is, z = c (mod s). Similarly, we can write 
z = r (s + 2pr) — c, so z = — c (mod r) as well. 

To make the meaning of Thm. 6.8 more explicit from the viewpoint of arithmetic 
geometry, it is best to work in the case S = Z. In this case, pr 2 + qs 2 — c defines a 
conic C C C 2 , px + qy = —c defines a "vertical plane" P C C 3 , while xy — z 2 = — c 2 
defines a quadric surface Q C C 3 . The map / : C — > P(~)Q given by the polynomials in 
(6.10) is then an affine morphism defined over Z, taking integer points to integer points. 
Furthermore, the ring Z can be replaced throughout by an arbitrary ring of algebraic 
integers. 

Example 6.14. Over S — Z , let p — —3, q — 8, and c = 5. Obviously, all four points 
(±1, ±1) are on the conic C. Using (6.10), we compute easily that 

(6.15) /(l, 1) = /(-l, -1) = (15, 5, -10), /(l, -1) = /(-l, 1) = (-17, -7, -12), 

which all lie on the curve P f]Q. However, the map f : C ^ P (~) Q is not surjective 
in this example. For instance, we claim that (15, 5, 10) e PnQ is not in f(C). To see 
this, assume for the moment that /(r, s) = (15, 5, 10) for some (r, s) G C . By Remark 
6.12, we must have 10 = 5 (mod s), so s divides 5. If s = ±5, then — 3r 2 + 8s 2 = 5 
leads to a quick contradiction. Thus, s — ±1, and hence also r — ±1. But according to 

(6.15) , the z-coordinate of /(r, s) must then be either —10 or —12, a contradiction. (In 
(6.18)(C) below, we'll actually give some examples where C = 0, but P C\Q ^ To 
test the accuracy of the formulas (6.10), it is worthwhile to compute a few more image 
points for the map /. For instance, for (±3, 2) on the conic C, we have 

(6.16) /(3, 2) = (87, 32, -53), and /(-3, 2) = (-105, -40, -65), 

which are indeed points in PnQ. (Recall that the functions x, y, z grow quadratically 
with respect to the two variables r and s.) 

We record the following consequence of Theorem 6.8, since we cannot locate a refer- 
ence for it (or for any similar result) in the literature. 

Corollary 6.17. For any p,q G S, we have the following: 

(1) If c G V[p, q], there exist x,y,z G S such that px + qy = — c and xy — z 2 = —c 2 . 
(There also exist Xi,yi,Zi G S such that px\ + qy\ = c and x±yi — z\ = — c 2 .) 

(2) If V[p,q] contains a unit of S, then there exist x 2 ,y2,Z2 £ S such that 
px 2 + qy 2 = x 2 y 2 ~4 = - 1 - 
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Proof. (1) follows from Thm. 6.8, and the parenthetical statement follows from the 
main statement by taking (xi,yi,zi) to be (—x,—y,±z). 

(2) Fix a unit c £ V[p, q], and take x,y,z as in (1). Then x 2 = c _1 x, y 2 = c^y, and 
z 2 = c~ x z satisfy the required conditions. □ 

Remark 6.18. (A) Note that, in (1) above, we cannot say that "there exist x 3 ,y 3 , z 3 £ 
S such that px3 + qy% = c and X3V3 — z 2 = c 2 ." Indeed, for S = Z , take p = —4, 
q = 13, and c = 1. We have c £ V"[p, g] since 1 = — 4r 2 + 13s 2 for (r, s) = (9,5). 
However, using standard software for solving binary quadratic equations (such as [Alp]), 
we can easily check that there do not exist integers x 3 , y 3 , z 3 such that — 4x 3 + 13y 3 = 
xm - 4 = 1 - 

(B) For S = Z, the following numerical example shows that, in case V[p, q] contains a 
unit, say 1, the representation of 1 in the form pr 2 + qs 2 may involve very large integers 
r and s, even though p, q are pretty small. For instance, let p = 37 and q = —67. 
Then 1 £ V[p, g] according to [Alp], but the smallest solution for 37r 2 — 67s 2 = 1 is 

(6.19) r = 264,638,639,242, and s = 196,660,308,201. 

Confirming our result in Cor. 6.17, [Alp] showed that indeed solutions exist for the 
equations 37 x 2 — 67 y 2 = x 2 y 2 — z 2 = — 1- However, the numbers x 2 ,y 2 ,z 2 have at least 
19 digits! Of course, the specific solution (x,y, z) constructed from (6.10) by using the 
point (r, s) in (6.19) is even larger. 

(C) We should also point out that the converse to the main statement in (6.17)(1) is 
not true in general. For instance, let S — Z again, and take c = 1. For any p > 1 and 
q = p + 1, the equations px + qy = xy — z 2 = —1 are solved by (x, y, z) = (1, —1, 0), 
but obviously ±1 ^ V[p,p+ 1]. It is, however, possibly more interesting to give an 
example where ±1 £ V[p, q] is not simply ruled out "by absolute values". For this, we 
can take, for instance, p = —8, q = 13, for which the equations px + qy = xy — z 2 = —1 
are solved by (x,y,z) = (5,3,4). Nevertheless, ±1 ^ V[p, q], in view of the fact that 
±13 (or ±5) is not a square modulo 8. 

In [KLx], two of the authors study the problem of factorizing a matrix A into a 
product XY in such a way that the commutator [X, Y] is invertible. The matrices A 
that admit such a factorization are said to be reflectable. For some applications of the 
results (6.3), (6.8), and (6.17) in this section to the study of reflectable matrices over 
commutative rings, see §5 in [KLx]. 
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